Bambus 2: scaffolding metagenomes

نویسندگان

  • Sergey Koren
  • Todd J. Treangen
  • Mihai Pop
چکیده

MOTIVATION Sequencing projects increasingly target samples from non-clonal sources. In particular, metagenomics has enabled scientists to begin to characterize the structure of microbial communities. The software tools developed for assembling and analyzing sequencing data for clonal organisms are, however, unable to adequately process data derived from non-clonal sources. RESULTS We present a new scaffolder, Bambus 2, to address some of the challenges encountered when analyzing metagenomes. Our approach relies on a combination of a novel method for detecting genomic repeats and algorithms that analyze assembly graphs to identify biologically meaningful genomic variants. We compare our software to current assemblers using simulated and real data. We demonstrate that the repeat detection algorithms have higher sensitivity than current approaches without sacrificing specificity. In metagenomic datasets, the scaffolder avoids false joins between distantly related organisms while obtaining long-range contiguity. Bambus 2 represents a first step toward automated metagenomic assembly. AVAILABILITY Bambus 2 is open source and available from http://amos.sf.net. CONTACT [email protected]. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical scaffolding with Bambus.

The output of a genome assembler generally comprises a collection of contiguous DNA sequences (contigs) whose relative placement along the genome is not defined. A procedure called scaffolding is commonly used to order and orient these contigs using paired read information. This ordering of contigs is an essential step when finishing and analyzing the data from a whole-genome shotgun project. M...

متن کامل

[A new technique for the study of the mosquito (Diptera: Culicidae) fauna in bamboo internodes, with preliminary results].

Laboratório de Entomologia Médica, Departamento de Microbiologia e Parasitologia do Centro de Ciências Biológicas da Universidade Federal de Santa Catarina, Florianópolis, SC, Brasil. Recebido para publicação em 11/8/2003 Aceito em 1/9/2003 Os bambus e taquaras podem fornecer criadouros para mosquitos de várias espécies, principalmente de Sabethini , e sua fauna tem sido estudada em vários cont...

متن کامل

DFT Study on the Complexation of Bambus[6]uril with the Perchlorate and Tetrafluoroborate Anions.

By using quantum mechanical DFT calculations, the most probable structures of the bambus[6]uril.ClO4- and bambus[6]uril.BF4- anionic complex species were derived. In these two complexes having C3 symmetry, each of the considered anions, included in the macrocyclic cavity, is bound by 12 weak hydrogen bonds between methine hydrogen atoms on the convex face of glycoluril units and the respective ...

متن کامل

Bambus[6]uril as a novel macrocyclic receptor for the nitrate anion.

By using quantum mechanical DFT calculations, the most probable structure of the bambus[6]uril x NO3(-) anionic complex species was derived. In this complex having C3 symmetry, the nitrate anion NO3(-), included in the macrocyclic cavity, is bound by twelve weak hydrogen bonds between methine hydrogen atoms on the convex face of glycoluril units and the considered NO3(-) ion.

متن کامل

Better Identification of Repeats in Metagenomic Scaffolding

Genomic repeats are the most important challenge in genomic assembly. While for single genomes the effect of repeats is largely addressed by modern long-read sequencing technologies, in metagenomic data intra-genome and, more importantly, inter-genome repeats continue to be a significant impediment to effective genome reconstruction. Detecting repeats in metagenomic samples is complicated by ch...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 27 21  شماره 

صفحات  -

تاریخ انتشار 2011